Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Ar | 4782 | 266 | 3 | 88.6667 |
voe | 12475 | 353 | 8 | 44.1250 |
reas | 5303 | 122 | 3 | 40.6667 |
An | 3137 | 188 | 5 | 37.6000 |
oa | 18479 | 316 | 10 | 31.6000 |
Ne | 1177 | 62 | 2 | 31.0000 |
vezañ | 2817 | 121 | 4 | 30.2500 |
vez | 8075 | 272 | 11 | 24.7273 |
ra | 5020 | 111 | 5 | 22.2000 |
Ma | 296 | 16 | 1 | 16.0000 |
kaver | 506 | 24 | 2 | 12.0000 |
veze | 2348 | 105 | 9 | 11.6667 |
Eil | 144 | 11 | 1 | 11.0000 |
zouez | 327 | 11 | 1 | 11.0000 |
reont | 609 | 32 | 3 | 10.6667 |
gasas | 105 | 10 | 1 | 10.0000 |
rejont | 309 | 20 | 2 | 10.0000 |
Du | 363 | 28 | 3 | 9.3333 |
Here | 335 | 28 | 3 | 9.3333 |
bet | 11522 | 349 | 38 | 9.1842 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
meur | 2226 | 1 | 70 | 0.0143 |
mestr | 137 | 1 | 9 | 0.1111 |
goloet | 91 | 1 | 8 | 0.1250 |
c’hoant | 68 | 1 | 8 | 0.1250 |
feur-emglev | 51 | 1 | 7 | 0.1429 |
harpet | 76 | 1 | 7 | 0.1429 |
enebet | 70 | 1 | 7 | 0.1429 |
le | 122 | 1 | 7 | 0.1429 |
n'eo | 914 | 4 | 26 | 0.1538 |
ez | 4339 | 34 | 215 | 0.1581 |
gambr | 60 | 1 | 6 | 0.1667 |
American | 51 | 1 | 6 | 0.1667 |
rumm | 49 | 1 | 6 | 0.1667 |
dieubidigezh | 74 | 1 | 6 | 0.1667 |
miliadoù | 65 | 1 | 6 | 0.1667 |
meret | 43 | 1 | 6 | 0.1667 |
dizalc'hiezh | 40 | 1 | 6 | 0.1667 |
vered | 54 | 1 | 6 | 0.1667 |
m’eo | 52 | 1 | 6 | 0.1667 |
gostezenn | 74 | 1 | 6 | 0.1667 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II